#IT326 Project

Description of the dataset:

This dataset, obtained from vgchartz.com, provides a valuable resource for exploring the dynamics between gaming platforms and genres in the top 100 global video games. It enables us to analyze the platforms that are influencing worldwide sales, identify the most prosperous genres in various global regions, and track the evolving trends in both platform preference and genre popularity over time.

Our goal:

Our goal from studying this dataset is to utilize classification and clustering techniques on the input data to make predictions about the popularity of upcoming games.

Attributes description:

This dataset has 11 attributes and 16599 objects.

Rank: Ranking of the game based on global sales.

Name: Name of the game.

Platform: Platform the game was released on.

Year: Year the game was released.

Genre: Genre of the game.

Publisher: Publisher of the game.

NA_Sales: Sales of the game in North America.

EU_Sales: Sales of the game in Europe.

JP_Sales: Sales of the game in Japan.

Other_Sales: Sales of the game in other regions.

Global_Sales: Total sales of the game worldwide.

Class label:

Popular’ is our class label, we will use Global_Sales attribute to predict whether a game will sell 1000000 or more globally. Our task of data mining is regression.

dataset=read.csv("vgsales.csv")
Warning: cannot open file 'vgsales.csv': No such file or directoryError in file(file, "rt") : cannot open the connection

Importing our dataset.

library(outliers) 
library(dplyr)

Attaching package: ‘dplyr’

The following objects are masked from ‘package:stats’:

    filter, lag

The following objects are masked from ‘package:base’:

    intersect, setdiff, setequal, union
library(Hmisc)
Registered S3 methods overwritten by 'htmltools':
  method               from         
  print.html           tools:rstudio
  print.shiny.tag      tools:rstudio
  print.shiny.tag.list tools:rstudio
Registered S3 method overwritten by 'htmlwidgets':
  method           from         
  print.htmlwidget tools:rstudio
Registered S3 method overwritten by 'data.table':
  method           from
  print.data.table     

Attaching package: ‘Hmisc’

The following objects are masked from ‘package:dplyr’:

    src, summarize

The following objects are masked from ‘package:base’:

    format.pval, units
library(ggplot2)
library(cowplot)
library(mlbench)
library(caret)
Loading required package: lattice
library(faux)

************
Welcome to faux. For support and examples visit:
https://debruine.github.io/faux/
- Get and set global package options with: faux_options()
************
library(DataExplorer)
library(randomForest)
randomForest 4.7-1.1
Type rfNews() to see new features/changes/bug fixes.

Attaching package: ‘randomForest’

The following object is masked from ‘package:ggplot2’:

    margin

The following object is masked from ‘package:dplyr’:

    combine

The following object is masked from ‘package:outliers’:

    outlier

loading libraries needed for our data mining tasks.

nrow(dataset)
[1] 16598
ncol(dataset)
[1] 11
dim(dataset)
[1] 16598    11
names(dataset)
 [1] "Rank"         "Name"         "Platform"     "Year"         "Genre"       
 [6] "Publisher"    "NA_Sales"     "EU_Sales"     "JP_Sales"     "Other_Sales" 
[11] "Global_Sales"

General info about our dataset including number of rows and columns, and cheking dimensionality and coulumns names.

str(dataset)
'data.frame':   16598 obs. of  11 variables:
 $ Rank        : int  1 2 3 4 5 6 7 8 9 10 ...
 $ Name        : chr  "Wii Sports" "Super Mario Bros." "Mario Kart Wii" "Wii Sports Resort" ...
 $ Platform    : chr  "Wii" "NES" "Wii" "Wii" ...
 $ Year        : chr  "2006" "1985" "2008" "2009" ...
 $ Genre       : chr  "Sports" "Platform" "Racing" "Sports" ...
 $ Publisher   : chr  "Nintendo" "Nintendo" "Nintendo" "Nintendo" ...
 $ NA_Sales    : num  41.5 29.1 15.8 15.8 11.3 ...
 $ EU_Sales    : num  29.02 3.58 12.88 11.01 8.89 ...
 $ JP_Sales    : num  3.77 6.81 3.79 3.28 10.22 ...
 $ Other_Sales : num  8.46 0.77 3.31 2.96 1 0.58 2.9 2.85 2.26 0.47 ...
 $ Global_Sales: num  82.7 40.2 35.8 33 31.4 ...

Dataset structure including number of coulums and rows, attribute types.

head(dataset, 10)

sample of raw dataset(first 10 rows).

tail(dataset, 10)

sample of raw dataset(last 10 rows).

summary(dataset)
      Rank           Name             Platform             Year              Genre          
 Min.   :    1   Length:16598       Length:16598       Length:16598       Length:16598      
 1st Qu.: 4151   Class :character   Class :character   Class :character   Class :character  
 Median : 8300   Mode  :character   Mode  :character   Mode  :character   Mode  :character  
 Mean   : 8301                                                                              
 3rd Qu.:12450                                                                              
 Max.   :16600                                                                              
  Publisher            NA_Sales          EU_Sales          JP_Sales       
 Length:16598       Min.   : 0.0000   Min.   : 0.0000   Min.   : 0.00000  
 Class :character   1st Qu.: 0.0000   1st Qu.: 0.0000   1st Qu.: 0.00000  
 Mode  :character   Median : 0.0800   Median : 0.0200   Median : 0.00000  
                    Mean   : 0.2647   Mean   : 0.1467   Mean   : 0.07778  
                    3rd Qu.: 0.2400   3rd Qu.: 0.1100   3rd Qu.: 0.04000  
                    Max.   :41.4900   Max.   :29.0200   Max.   :10.22000  
  Other_Sales        Global_Sales    
 Min.   : 0.00000   Min.   : 0.0100  
 1st Qu.: 0.00000   1st Qu.: 0.0600  
 Median : 0.01000   Median : 0.1700  
 Mean   : 0.04806   Mean   : 0.5374  
 3rd Qu.: 0.04000   3rd Qu.: 0.4700  
 Max.   :10.57000   Max.   :82.7400  

summary of our dataset.

var(dataset$NA_Sales)
[1] 0.6669712
var(dataset$EU_Sales)
[1] 0.2553799
var(dataset$JP_Sales)
[1] 0.0956607
var(dataset$Other_Sales)
[1] 0.03556559
var(dataset$Global_Sales)
[1] 2.418112

variance of numeric data

Graphs:

dataset2 <- dataset %>% sample_n(50)
tab <- dataset2$Platform %>% table()
precentages <- tab %>% prop.table() %>% round(3) * 100 
txt <- paste0(names(tab), '\n', precentages, '%') 

pie(tab, labels=txt , main = "Pie chart of Platform") 

We notice from the pie chart of platform attribute that releasing a game for PS users will increase the popularity of the game since it is the most common platform among gamers.

# coloring barplot and adding text
tab<-dataset$Genre %>% table() 

precentages<-tab %>% prop.table() %>% round(3)*100 

txt<-paste0(names(tab), '\n',precentages,'%') 

bb <- dataset$Genre %>% table() %>% barplot(axisnames=F, main = "Barplot for Popular genres ",ylab='count',col=c('pink','blue','lightblue','green','lightgreen','red','orange','red','grey','yellow','azure','olivedrab')) 

text(bb,tab/2,labels=txt,cex=1.5) 

In terms of genre, action games are the most popular, followed by sports and music games. It is safe to assume that a high number of genres of this nature exist due to their popularity and sales.

boxplot(dataset$NA_Sales , main="
BoxPlot for NA_Sales")

boxplot(dataset$EU_Sales, main="
 BoxPlot for EU_Sales")

boxplot(dataset$JP_Sales , main="
 BoxPlot for JP_Sales")

boxplot(dataset$Other_Sales , main="
 BoxPlot for Other_Sales") 

The boxplot of the Other-sales attribute indicate that the values are close to each other ,and there is a lot of outliers since the dataset represents the global sales of video games.

boxplot(dataset$Global_Sales , main="BoxPlot for Global_Sales")

The boxplot of the Global-sales attribute indicate that the values are close to each other ,and there is a lot of outliers since the dataset represents the global sales of video games.

qplot(data = dataset, x=Global_Sales,y=Genre,fill=I("yellow"),width=0.5 ,geom = "boxplot" , main = "BoxPlots for genre and Global_Sales")
Warning: `qplot()` was deprecated in ggplot2 3.4.0.

dataset$Year %>% table() %>% barplot( main = "Barplot for year")

pairs(~NA_Sales + EU_Sales + JP_Sales + Other_Sales + Global_Sales, data = dataset,
      main = "Sales Scatterplot")

We used Scatterplot to determine the type of correlation we have between the sales; we can see that the majority have positive correlation with each other.

Pre - processing

Varaible transformation

dataset$Rank=as.character(dataset$Rank)

Rank should be char and not numeric,because we will use them as ordinal data.

Null checking

sum(is.na(dataset$Rank))
[1] 0
NullRank<-dataset[dataset$Rank=="N/A",]
NullRank

checking for nulls in Rank (there is no nulls)

sum(is.na(dataset$Name))
NullName<-dataset[dataset$Name=="N/A",]
NullName

checking for nulls in name (there is no nulls)

sum(is.na(dataset$Platform))
[1] 0
NullPlatform<-dataset[dataset$Platform=="N/A",]

checking for nulls in Platform(there is no nulls)

sum(is.na(dataset$year))
[1] 0
NullYear<-dataset[dataset$Year=="N/A",]
NullYear

checking for nulls in year we won’t delete the null and we will leave them as global constant as it is because we want the sales data of them.

sum(is.na(dataset$Other_Sales))
[1] 0
NullOther_Sales<-dataset[dataset$Other_Sales=="N/A",]

There is no null values in the other_sales.

sum(is.na(dataset$Genre))
[1] 0
NullGenre<-dataset[dataset$Genre=="N/A",]
NullGenre

checking for nulls in Genre(there is no nulls)

sum(is.na(dataset$Publisher))
[1] 0
NullPublisher<-dataset[dataset$Publisher=="N/A",]
NullPublisher

checking for nulls in Publisher. we won’t delete the null and we will leave them as global constant as it is because we want the sales data of them.

sum(is.na(dataset$Global_Sales))
[1] 0
NullGlobal_Sales<-dataset[dataset$Global_Saless=="N/A",]

There is no null values in the Global_Sales.

Encoding

dataset$Platform=factor(dataset$Platform,levels=c("2600","3DO","3DS","DC","DS","GB","GBA","GC","GEN","GG","N64","NES","NG","PC","PCFX","PS","PS2","PS3","PS4","PSP","PSV","SAT","SCD","SNES","TG16","Wii","WiiU","WS","X360","XB","XOne"), labels=c(1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31))

Since most machine learning algorithms work with numbers and not with text or categorical variables, this column will be encoded.

dataset$Genre=factor(dataset$Genre,levels=c("Action","Adventure","Fighting","Platform","Puzzle","Racing","Role-Playing","Shooter","Simulation","Sports","Strategy","Misc"),labels=c(1,2,3,4,5,6,7,8,9,10,11,12))

Since most machine learning algorithms work with numbers and not with text or categorical variables, this column will be encoded.

Outliers

outlier of NA_Sales

OutNA_Sales = outlier(dataset$NA_Sales, logical =TRUE)
Error in if (nrow(x) != ncol(x)) stop("x must be a square matrix") : 
  argument is of length zero

outlier of EU_Sales

OutEU_Sales = outlier(dataset$EU_Sales, logical =TRUE)
Error in if (nrow(x) != ncol(x)) stop("x must be a square matrix") : 
  argument is of length zero

outlier of JP_Sales

OutJP_Sales = outlier(dataset$JP_Sales, logical =TRUE)
Error in if (nrow(x) != ncol(x)) stop("x must be a square matrix") : 
  argument is of length zero

outlier of other_sales

OutOS=outlier(dataset$Other_Sales, logical=TRUE)  
Error in if (nrow(x) != ncol(x)) stop("x must be a square matrix") : 
  argument is of length zero

outlier of Global_sales

OutGS=outlier(dataset$Global_Sales, logical=TRUE)  
Error in if (nrow(x) != ncol(x)) stop("x must be a square matrix") : 
  argument is of length zero

Remove outliers

dataset= dataset[-Find_outlier,]
Error: object 'Find_outlier' not found

Normalization

datsetWithoutNormalization<-dataset

dataset before normalization

normalize <- function(x) {return ((x - min(x)) / (max(x) - min(x)))}
dataset$NA_Sales<-normalize(datsetWithoutNormalization$NA_Sales)
dataset$EU_Sales<-normalize(datsetWithoutNormalization$EU_Sales)
dataset$JP_Sales<-normalize(datsetWithoutNormalization$JP_Sales)
dataset$Other_Sales<-normalize(datsetWithoutNormalization$Other_Sales)
dataset$Global_Sales<-normalize(datsetWithoutNormalization$Global_Sales)

min-max normalization we will use the min-max normalization; it’s better for visualization.

Feautre selection

Our class label (popular) refers to Global_Sales. Other sales regions will be evaluated based on their importance to (global_sales) column. and those that are less important will be deleted from the dataset. use roc_curve area as score

roc_imp <- filterVarImp(x = dataset[,7:10], y = dataset$Global_Sales)

sort the score in decreasing order

roc_imp <- data.frame(cbind(variable = rownames(roc_imp), score = roc_imp[,1]))
roc_imp$score <- as.double(roc_imp$score)
roc_imp[order(roc_imp$score,decreasing = TRUE),]

we will rmove the (JP_Sales) because it is of low importance to our class_label(Global_Sales)

dataset<- dataset[,-9]

#Dataset after pre-processing

LS0tDQpvdXRwdXQ6IGh0bWxfbm90ZWJvb2sNCi0tLQ0KI0lUMzI2IFByb2plY3QNCg0KDQoNCg0KIyBEZXNjcmlwdGlvbiBvZiB0aGUgZGF0YXNldDoNCg0KVGhpcyBkYXRhc2V0LCBvYnRhaW5lZCBmcm9tIHZnY2hhcnR6LmNvbSwgcHJvdmlkZXMgYSB2YWx1YWJsZSByZXNvdXJjZSBmb3IgZXhwbG9yaW5nIHRoZSBkeW5hbWljcyBiZXR3ZWVuIGdhbWluZyBwbGF0Zm9ybXMgYW5kIGdlbnJlcyBpbiB0aGUgdG9wIDEwMCBnbG9iYWwgdmlkZW8gZ2FtZXMuIEl0IGVuYWJsZXMgdXMgdG8gYW5hbHl6ZSB0aGUgcGxhdGZvcm1zIHRoYXQgYXJlIGluZmx1ZW5jaW5nIHdvcmxkd2lkZSBzYWxlcywgaWRlbnRpZnkgdGhlIG1vc3QgcHJvc3Blcm91cyBnZW5yZXMgaW4gdmFyaW91cyBnbG9iYWwgcmVnaW9ucywgYW5kIHRyYWNrIHRoZSBldm9sdmluZyB0cmVuZHMgaW4gYm90aCBwbGF0Zm9ybSBwcmVmZXJlbmNlIGFuZCBnZW5yZSBwb3B1bGFyaXR5IG92ZXIgdGltZS4gDQoNCiMgU291cmNlIGFuZCBsaW5rOg0KU291cmNlOiB2Z2NoYXJ0ei5jb20NCg0KVVJMIGxpbms6IGh0dHBzOi8vd3d3LmthZ2dsZS5jb20vZGF0YXNldHMvZ3JlZ29ydXQvdmlkZW9nYW1lc2FsZXMNCg0KIyBPdXIgZ29hbDoNCg0KT3VyIGdvYWwgIGZyb20gc3R1ZHlpbmcgdGhpcyBkYXRhc2V0IGlzIHRvIHV0aWxpemUgY2xhc3NpZmljYXRpb24gYW5kIGNsdXN0ZXJpbmcgdGVjaG5pcXVlcyBvbiB0aGUgaW5wdXQgZGF0YSB0byBtYWtlIHByZWRpY3Rpb25zIGFib3V0IHRoZSBwb3B1bGFyaXR5IG9mIHVwY29taW5nIGdhbWVzLg0KDQoNCg0KIyBBdHRyaWJ1dGVzIGRlc2NyaXB0aW9uOg0KDQpUaGlzIGRhdGFzZXQgaGFzIDExIGF0dHJpYnV0ZXMgYW5kIDE2NTk5IG9iamVjdHMuDQoNClJhbms6IFJhbmtpbmcgb2YgdGhlIGdhbWUgYmFzZWQgb24gZ2xvYmFsIHNhbGVzLiANCg0KTmFtZTogTmFtZSBvZiB0aGUgZ2FtZS4gDQoNClBsYXRmb3JtOiBQbGF0Zm9ybSB0aGUgZ2FtZSB3YXMgcmVsZWFzZWQgb24uDQoNClllYXI6IFllYXIgdGhlIGdhbWUgd2FzIHJlbGVhc2VkLiANCg0KR2VucmU6IEdlbnJlIG9mIHRoZSBnYW1lLg0KDQpQdWJsaXNoZXI6IFB1Ymxpc2hlciBvZiB0aGUgZ2FtZS4gDQoNCk5BX1NhbGVzOiBTYWxlcyBvZiB0aGUgZ2FtZSBpbiBOb3J0aCBBbWVyaWNhLiANCg0KRVVfU2FsZXM6IFNhbGVzIG9mIHRoZSBnYW1lIGluIEV1cm9wZS4gDQoNCkpQX1NhbGVzOiBTYWxlcyBvZiB0aGUgZ2FtZSBpbiBKYXBhbi4gDQoNCk90aGVyX1NhbGVzOiBTYWxlcyBvZiB0aGUgZ2FtZSBpbiBvdGhlciByZWdpb25zLg0KDQpHbG9iYWxfU2FsZXM6IFRvdGFsIHNhbGVzIG9mIHRoZSBnYW1lIHdvcmxkd2lkZS4NCg0KDQojIENsYXNzIGxhYmVsOg0KDQpQb3B1bGFyJyBpcyBvdXIgY2xhc3MgbGFiZWwsIHdlIHdpbGwgdXNlIEdsb2JhbF9TYWxlcyBhdHRyaWJ1dGUgdG8gcHJlZGljdCB3aGV0aGVyIGEgZ2FtZSB3aWxsIHNlbGwgMTAwMDAwMCBvciBtb3JlIGdsb2JhbGx5LiBPdXIgdGFzayBvZiBkYXRhIG1pbmluZyBpcyByZWdyZXNzaW9uLg0KDQoNCg0KDQpgYGB7cn0NCmRhdGFzZXQ9cmVhZC5jc3YoInZnc2FsZXMuY3N2IikNCmBgYA0KSW1wb3J0aW5nIG91ciBkYXRhc2V0Lg0KDQoNCmBgYHtyfQ0KbGlicmFyeShvdXRsaWVycykgDQpsaWJyYXJ5KGRwbHlyKQ0KbGlicmFyeShIbWlzYykNCmxpYnJhcnkoZ2dwbG90MikNCmxpYnJhcnkoY293cGxvdCkNCmxpYnJhcnkobWxiZW5jaCkNCmxpYnJhcnkoY2FyZXQpDQpsaWJyYXJ5KGZhdXgpDQpsaWJyYXJ5KERhdGFFeHBsb3JlcikNCmxpYnJhcnkocmFuZG9tRm9yZXN0KQ0Kb3B0aW9ucyhtYXgucHJpbnQ9OTk5OTk5OSkNCmBgYA0KDQpsb2FkaW5nIGxpYnJhcmllcyBuZWVkZWQgZm9yIG91ciBkYXRhIG1pbmluZyB0YXNrcy4NCg0KDQpgYGB7cn0NCm5yb3coZGF0YXNldCkNCm5jb2woZGF0YXNldCkNCmRpbShkYXRhc2V0KQ0KbmFtZXMoZGF0YXNldCkNCmBgYA0KR2VuZXJhbCBpbmZvIGFib3V0IG91ciBkYXRhc2V0IGluY2x1ZGluZyAgbnVtYmVyIG9mIHJvd3MgYW5kIGNvbHVtbnMsIGFuZCBjaGVraW5nIGRpbWVuc2lvbmFsaXR5IGFuZCBjb3VsdW1ucyBuYW1lcy4NCg0KYGBge3J9DQpzdHIoZGF0YXNldCkNCmBgYA0KRGF0YXNldCBzdHJ1Y3R1cmUgaW5jbHVkaW5nIG51bWJlciBvZiBjb3VsdW1zIGFuZCByb3dzLCBhdHRyaWJ1dGUgdHlwZXMuIA0KDQpgYGB7cn0NCmhlYWQoZGF0YXNldCwgMTApDQpgYGANCnNhbXBsZSBvZiByYXcgZGF0YXNldChmaXJzdCAxMCByb3dzKS4NCg0KYGBge3J9DQp0YWlsKGRhdGFzZXQsIDEwKQ0KYGBgDQpzYW1wbGUgb2YgcmF3IGRhdGFzZXQobGFzdCAxMCByb3dzKS4NCg0KYGBge3J9DQpzdW1tYXJ5KGRhdGFzZXQpDQpgYGANCnN1bW1hcnkgb2Ygb3VyIGRhdGFzZXQuDQoNCmBgYHtyfQ0KdmFyKGRhdGFzZXQkTkFfU2FsZXMpDQp2YXIoZGF0YXNldCRFVV9TYWxlcykNCnZhcihkYXRhc2V0JEpQX1NhbGVzKQ0KdmFyKGRhdGFzZXQkT3RoZXJfU2FsZXMpDQp2YXIoZGF0YXNldCRHbG9iYWxfU2FsZXMpDQpgYGANCnZhcmlhbmNlIG9mIG51bWVyaWMgZGF0YQ0KDQojIEdyYXBoczoNCg0KYGBge3J9DQpkYXRhc2V0MiA8LSBkYXRhc2V0ICU+JSBzYW1wbGVfbig1MCkNCnRhYiA8LSBkYXRhc2V0MiRQbGF0Zm9ybSAlPiUgdGFibGUoKQ0KcHJlY2VudGFnZXMgPC0gdGFiICU+JSBwcm9wLnRhYmxlKCkgJT4lIHJvdW5kKDMpICogMTAwIA0KdHh0IDwtIHBhc3RlMChuYW1lcyh0YWIpLCAnXG4nLCBwcmVjZW50YWdlcywgJyUnKSANCg0KcGllKHRhYiwgbGFiZWxzPXR4dCAsIG1haW4gPSAiUGllIGNoYXJ0IG9mIFBsYXRmb3JtIikgDQoNCmBgYA0KDQpXZSBub3RpY2UgZnJvbSB0aGUgcGllIGNoYXJ0IG9mIHBsYXRmb3JtIGF0dHJpYnV0ZSB0aGF0IHJlbGVhc2luZyBhIGdhbWUgZm9yIFBTIHVzZXJzIHdpbGwgaW5jcmVhc2UgdGhlIHBvcHVsYXJpdHkgb2YgdGhlIGdhbWUgc2luY2UgaXQgaXMgdGhlIG1vc3QgY29tbW9uIHBsYXRmb3JtIGFtb25nIGdhbWVycy4gDQoNCg0KDQoNCg0KYGBge3J9DQojIGNvbG9yaW5nIGJhcnBsb3QgYW5kIGFkZGluZyB0ZXh0DQp0YWI8LWRhdGFzZXQkR2VucmUgJT4lIHRhYmxlKCkgDQoNCnByZWNlbnRhZ2VzPC10YWIgJT4lIHByb3AudGFibGUoKSAlPiUgcm91bmQoMykqMTAwIA0KDQp0eHQ8LXBhc3RlMChuYW1lcyh0YWIpLCAnXG4nLHByZWNlbnRhZ2VzLCclJykgDQoNCmJiIDwtIGRhdGFzZXQkR2VucmUgJT4lIHRhYmxlKCkgJT4lIGJhcnBsb3QoYXhpc25hbWVzPUYsIG1haW4gPSAiQmFycGxvdCBmb3IgUG9wdWxhciBnZW5yZXMgIix5bGFiPSdjb3VudCcsY29sPWMoJ3BpbmsnLCdibHVlJywnbGlnaHRibHVlJywnZ3JlZW4nLCdsaWdodGdyZWVuJywncmVkJywnb3JhbmdlJywncmVkJywnZ3JleScsJ3llbGxvdycsJ2F6dXJlJywnb2xpdmVkcmFiJykpIA0KDQp0ZXh0KGJiLHRhYi8yLGxhYmVscz10eHQsY2V4PTEuNSkgDQpgYGANCkluIHRlcm1zIG9mIGdlbnJlLCBhY3Rpb24gZ2FtZXMgYXJlIHRoZSBtb3N0IHBvcHVsYXIsIGZvbGxvd2VkIGJ5IHNwb3J0cyBhbmQgbXVzaWMgZ2FtZXMuIEl0IGlzIHNhZmUgdG8gYXNzdW1lIHRoYXQgYSBoaWdoIG51bWJlciBvZiBnZW5yZXMgb2YgdGhpcyBuYXR1cmUgZXhpc3QgZHVlIHRvIHRoZWlyIHBvcHVsYXJpdHkgYW5kIHNhbGVzLg0KDQoNCmBgYHtyfQ0KYm94cGxvdChkYXRhc2V0JE5BX1NhbGVzICwgbWFpbj0iDQpCb3hQbG90IGZvciBOQV9TYWxlcyIpDQpgYGANCg0KYGBge3J9DQpib3hwbG90KGRhdGFzZXQkRVVfU2FsZXMsIG1haW49Ig0KIEJveFBsb3QgZm9yIEVVX1NhbGVzIikNCmBgYA0KDQpgYGB7cn0NCmJveHBsb3QoZGF0YXNldCRKUF9TYWxlcyAsIG1haW49Ig0KIEJveFBsb3QgZm9yIEpQX1NhbGVzIikNCmBgYA0KDQoNCg0KDQpgYGB7cn0NCmJveHBsb3QoZGF0YXNldCRPdGhlcl9TYWxlcyAsIG1haW49Ig0KIEJveFBsb3QgZm9yIE90aGVyX1NhbGVzIikgDQpgYGAgIA0KDQpUaGUgYm94cGxvdCBvZiB0aGUgT3RoZXItc2FsZXMgYXR0cmlidXRlIGluZGljYXRlIHRoYXQgdGhlIHZhbHVlcyBhcmUgY2xvc2UgdG8gZWFjaCBvdGhlciAsYW5kIHRoZXJlIGlzIGEgbG90IG9mIG91dGxpZXJzIHNpbmNlIHRoZSBkYXRhc2V0IHJlcHJlc2VudHMgdGhlIGdsb2JhbCBzYWxlcyBvZiB2aWRlbyBnYW1lcy4gDQoNCg0KDQoNCmBgYHtyfQ0KYm94cGxvdChkYXRhc2V0JEdsb2JhbF9TYWxlcyAsIG1haW49IkJveFBsb3QgZm9yIEdsb2JhbF9TYWxlcyIpDQoNCmBgYCAgDQpUaGUgYm94cGxvdCBvZiB0aGUgR2xvYmFsLXNhbGVzIGF0dHJpYnV0ZSBpbmRpY2F0ZSB0aGF0IHRoZSB2YWx1ZXMgYXJlIGNsb3NlIHRvIGVhY2ggb3RoZXIgLGFuZCB0aGVyZSBpcyBhIGxvdCBvZiBvdXRsaWVycyBzaW5jZSB0aGUgZGF0YXNldCByZXByZXNlbnRzIHRoZSBnbG9iYWwgc2FsZXMgb2YgdmlkZW8gZ2FtZXMuIA0KDQoNCg0KDQpgYGB7cn0NCnFwbG90KGRhdGEgPSBkYXRhc2V0LCB4PUdsb2JhbF9TYWxlcyx5PUdlbnJlLGZpbGw9SSgieWVsbG93Iiksd2lkdGg9MC41ICxnZW9tID0gImJveHBsb3QiICwgbWFpbiA9ICJCb3hQbG90cyBmb3IgZ2VucmUgYW5kIEdsb2JhbF9TYWxlcyIpDQpgYGANCg0KYGBge3J9DQpkYXRhc2V0JFllYXIgJT4lIHRhYmxlKCkgJT4lIGJhcnBsb3QoIG1haW4gPSAiQmFycGxvdCBmb3IgeWVhciIpDQpgYGANCg0KYGBge3J9DQpwYWlycyh+TkFfU2FsZXMgKyBFVV9TYWxlcyArIEpQX1NhbGVzICsgT3RoZXJfU2FsZXMgKyBHbG9iYWxfU2FsZXMsIGRhdGEgPSBkYXRhc2V0LA0KICAgICAgbWFpbiA9ICJTYWxlcyBTY2F0dGVycGxvdCIpDQpgYGAgICAgDQpXZSB1c2VkIFNjYXR0ZXJwbG90IHRvIGRldGVybWluZSB0aGUgdHlwZSBvZiBjb3JyZWxhdGlvbiB3ZSBoYXZlIGJldHdlZW4gdGhlIHNhbGVzOyB3ZSBjYW4gc2VlIHRoYXQgdGhlIG1ham9yaXR5IGhhdmUgcG9zaXRpdmUgY29ycmVsYXRpb24gd2l0aCBlYWNoIG90aGVyLiANCiANCiANCiAgICAgIA0KIyBQcmUgLSBwcm9jZXNzaW5nDQoNCiMgVmFyYWlibGUgdHJhbnNmb3JtYXRpb24NCmBgYHtyfQ0KZGF0YXNldCRSYW5rPWFzLmNoYXJhY3RlcihkYXRhc2V0JFJhbmspDQpgYGANClJhbmsgc2hvdWxkIGJlIGNoYXIgYW5kIG5vdCBudW1lcmljLGJlY2F1c2Ugd2Ugd2lsbCB1c2UgdGhlbSBhcyBvcmRpbmFsIGRhdGEuDQoNCiMgTnVsbCBjaGVja2luZw0KYGBge3J9DQpzdW0oaXMubmEoZGF0YXNldCRSYW5rKSkNCk51bGxSYW5rPC1kYXRhc2V0W2RhdGFzZXQkUmFuaz09Ik4vQSIsXQ0KTnVsbFJhbmsNCmBgYA0KY2hlY2tpbmcgZm9yIG51bGxzIGluIFJhbmsgKHRoZXJlIGlzIG5vIG51bGxzKQ0KYGBge3J9DQpzdW0oaXMubmEoZGF0YXNldCROYW1lKSkNCk51bGxOYW1lPC1kYXRhc2V0W2RhdGFzZXQkTmFtZT09Ik4vQSIsXQ0KTnVsbE5hbWUNCmBgYA0KDQpjaGVja2luZyBmb3IgbnVsbHMgaW4gbmFtZSAodGhlcmUgaXMgbm8gbnVsbHMpDQoNCmBgYHtyfQ0Kc3VtKGlzLm5hKGRhdGFzZXQkUGxhdGZvcm0pKQ0KTnVsbFBsYXRmb3JtPC1kYXRhc2V0W2RhdGFzZXQkUGxhdGZvcm09PSJOL0EiLF0NCg0KDQpgYGANCmNoZWNraW5nIGZvciBudWxscyBpbiBQbGF0Zm9ybSh0aGVyZSBpcyBubyBudWxscykNCg0KYGBge3J9DQpzdW0oaXMubmEoZGF0YXNldCRZZWFyKSkNCk51bGxZZWFyPC1kYXRhc2V0W2RhdGFzZXQkWWVhcj09Ik4vQSIsXQ0KTnVsbFllYXINCmBgYA0KY2hlY2tpbmcgZm9yIG51bGxzIGluIHllYXINCndlIHdvbid0IGRlbGV0ZSB0aGUgbnVsbCBhbmQgd2Ugd2lsbCBsZWF2ZSB0aGVtIGFzIGdsb2JhbCBjb25zdGFudCBhcyBpdCBpcyBiZWNhdXNlIHdlIHdhbnQgdGhlIHNhbGVzIGRhdGEgb2YgdGhlbS4NCg0KDQpgYGB7cn0NCnN1bShpcy5uYShkYXRhc2V0JE90aGVyX1NhbGVzKSkNCk51bGxPdGhlcl9TYWxlczwtZGF0YXNldFtkYXRhc2V0JE90aGVyX1NhbGVzPT0iTi9BIixdDQoNCg0KYGBgDQpUaGVyZSBpcyBubyBudWxsIHZhbHVlcyBpbiB0aGUgb3RoZXJfc2FsZXMuDQoNCmBgYHtyfQ0Kc3VtKGlzLm5hKGRhdGFzZXQkR2VucmUpKQ0KTnVsbEdlbnJlPC1kYXRhc2V0W2RhdGFzZXQkR2VucmU9PSJOL0EiLF0NCk51bGxHZW5yZQ0KYGBgDQpjaGVja2luZyBmb3IgbnVsbHMgaW4gR2VucmUodGhlcmUgaXMgbm8gbnVsbHMpDQpgYGB7cn0NCnN1bShpcy5uYShkYXRhc2V0JFB1Ymxpc2hlcikpDQpOdWxsUHVibGlzaGVyPC1kYXRhc2V0W2RhdGFzZXQkUHVibGlzaGVyPT0iTi9BIixdDQpOdWxsUHVibGlzaGVyDQpgYGANCmNoZWNraW5nIGZvciBudWxscyBpbiBQdWJsaXNoZXIuDQp3ZSB3b24ndCBkZWxldGUgdGhlIG51bGwgYW5kIHdlIHdpbGwgbGVhdmUgdGhlbSBhcyBnbG9iYWwgY29uc3RhbnQgYXMgaXQgaXMgYmVjYXVzZSB3ZSB3YW50IHRoZSBzYWxlcyBkYXRhIG9mIHRoZW0uDQoNCg0KYGBge3J9DQpzdW0oaXMubmEoZGF0YXNldCRHbG9iYWxfU2FsZXMpKQ0KTnVsbEdsb2JhbF9TYWxlczwtZGF0YXNldFtkYXRhc2V0JEdsb2JhbF9TYWxlc3M9PSJOL0EiLF0NCg0KDQpgYGANClRoZXJlIGlzIG5vIG51bGwgdmFsdWVzIGluIHRoZSBHbG9iYWxfU2FsZXMuDQoNCiMgRW5jb2RpbmcNCmBgYHtyfQ0KZGF0YXNldCRQbGF0Zm9ybT1mYWN0b3IoZGF0YXNldCRQbGF0Zm9ybSxsZXZlbHM9YygiMjYwMCIsIjNETyIsIjNEUyIsIkRDIiwiRFMiLCJHQiIsIkdCQSIsIkdDIiwiR0VOIiwiR0ciLCJONjQiLCJORVMiLCJORyIsIlBDIiwiUENGWCIsIlBTIiwiUFMyIiwiUFMzIiwiUFM0IiwiUFNQIiwiUFNWIiwiU0FUIiwiU0NEIiwiU05FUyIsIlRHMTYiLCJXaWkiLCJXaWlVIiwiV1MiLCJYMzYwIiwiWEIiLCJYT25lIiksIGxhYmVscz1jKDEsMiwzLDQsNSw2LDcsOCw5LDEwLDExLDEyLDEzLDE0LDE1LDE2LDE3LDE4LDE5LDIwLDIxLDIyLDIzLDI0LDI1LDI2LDI3LDI4LDI5LDMwLDMxKSkNCmBgYA0KU2luY2UgbW9zdCBtYWNoaW5lIGxlYXJuaW5nIGFsZ29yaXRobXMgd29yayB3aXRoIG51bWJlcnMgYW5kIG5vdCB3aXRoIHRleHQgb3IgY2F0ZWdvcmljYWwgdmFyaWFibGVzLCB0aGlzIGNvbHVtbiB3aWxsIGJlIGVuY29kZWQuDQoNCmBgYHtyfQ0KZGF0YXNldCRHZW5yZT1mYWN0b3IoZGF0YXNldCRHZW5yZSxsZXZlbHM9YygiQWN0aW9uIiwiQWR2ZW50dXJlIiwiRmlnaHRpbmciLCJQbGF0Zm9ybSIsIlB1enpsZSIsIlJhY2luZyIsIlJvbGUtUGxheWluZyIsIlNob290ZXIiLCJTaW11bGF0aW9uIiwiU3BvcnRzIiwiU3RyYXRlZ3kiLCJNaXNjIiksbGFiZWxzPWMoMSwyLDMsNCw1LDYsNyw4LDksMTAsMTEsMTIpKQ0KYGBgDQpTaW5jZSBtb3N0IG1hY2hpbmUgbGVhcm5pbmcgYWxnb3JpdGhtcyB3b3JrIHdpdGggbnVtYmVycyBhbmQgbm90IHdpdGggdGV4dCBvciBjYXRlZ29yaWNhbCB2YXJpYWJsZXMsIHRoaXMgY29sdW1uIHdpbGwgYmUgZW5jb2RlZC4NCg0KDQojIE91dGxpZXJzDQpvdXRsaWVyIG9mIE5BX1NhbGVzDQpgYGB7cn0NCk91dE5BX1NhbGVzID0gb3V0bGllcihkYXRhc2V0JE5BX1NhbGVzLCBsb2dpY2FsID1UUlVFKQ0Kc3VtKE91dE5BX1NhbGVzKQ0KRmluZF9vdXRsaWVyID0gd2hpY2goT3V0TkFfU2FsZXMgPT1UUlVFLCBhcnIuaW5kID0gVFJVRSkNCk91dE5BX1NhbGVzDQpGaW5kX291dGxpZXINCmBgYA0Kb3V0bGllciBvZiBFVV9TYWxlcw0KYGBge3J9DQpPdXRFVV9TYWxlcyA9IG91dGxpZXIoZGF0YXNldCRFVV9TYWxlcywgbG9naWNhbCA9VFJVRSkNCnN1bShPdXRFVV9TYWxlcykNCkZpbmRfb3V0bGllciA9IHdoaWNoKE91dEVVX1NhbGVzID09VFJVRSwgYXJyLmluZCA9IFRSVUUpDQpPdXRFVV9TYWxlcw0KRmluZF9vdXRsaWVyDQpgYGANCm91dGxpZXIgb2YgSlBfU2FsZXMNCmBgYHtyfQ0KT3V0SlBfU2FsZXMgPSBvdXRsaWVyKGRhdGFzZXQkSlBfU2FsZXMsIGxvZ2ljYWwgPVRSVUUpDQpzdW0oT3V0SlBfU2FsZXMpDQpGaW5kX291dGxpZXIgPSB3aGljaChPdXRKUF9TYWxlcyA9PVRSVUUsIGFyci5pbmQgPSBUUlVFKQ0KT3V0SlBfU2FsZXMNCkZpbmRfb3V0bGllcg0KYGBgDQoNCm91dGxpZXIgb2Ygb3RoZXJfc2FsZXMgDQpgYGB7cn0NCk91dE9TPW91dGxpZXIoZGF0YXNldCRPdGhlcl9TYWxlcywgbG9naWNhbD1UUlVFKSAgDQpzdW0oT3V0T1MpICANCkZpbmRfb3V0bGllcj13aGljaChPdXRPUz09VFJVRSwgYXJyLmluZD1UUlVFKSAgDQpPdXRPUyANCkZpbmRfb3V0bGllciANCg0KYGBgDQoNCg0Kb3V0bGllciBvZiBHbG9iYWxfc2FsZXMgDQoNCmBgYHtyfQ0KT3V0R1M9b3V0bGllcihkYXRhc2V0JEdsb2JhbF9TYWxlcywgbG9naWNhbD1UUlVFKSAgDQpzdW0oT3V0R1MpICANCkZpbmRfb3V0bGllcj13aGljaChPdXRHUz09VFJVRSwgYXJyLmluZD1UUlVFKSAgDQpPdXRHUyANCkZpbmRfb3V0bGllciANCg0KYGBgDQoNCg0KDQojIFJlbW92ZSBvdXRsaWVycyANCmBgYHtyfQ0KZGF0YXNldD0gZGF0YXNldFstRmluZF9vdXRsaWVyLF0NCmBgYA0KDQoNCg0KIyBOb3JtYWxpemF0aW9uDQoNCmBgYHtyfQ0KZGF0c2V0V2l0aG91dE5vcm1hbGl6YXRpb248LWRhdGFzZXQNCmBgYA0KZGF0YXNldCBiZWZvcmUgbm9ybWFsaXphdGlvbg0KDQpgYGB7cn0NCm5vcm1hbGl6ZSA8LSBmdW5jdGlvbih4KSB7cmV0dXJuICgoeCAtIG1pbih4KSkgLyAobWF4KHgpIC0gbWluKHgpKSl9DQpkYXRhc2V0JE5BX1NhbGVzPC1ub3JtYWxpemUoZGF0c2V0V2l0aG91dE5vcm1hbGl6YXRpb24kTkFfU2FsZXMpDQpkYXRhc2V0JEVVX1NhbGVzPC1ub3JtYWxpemUoZGF0c2V0V2l0aG91dE5vcm1hbGl6YXRpb24kRVVfU2FsZXMpDQpkYXRhc2V0JEpQX1NhbGVzPC1ub3JtYWxpemUoZGF0c2V0V2l0aG91dE5vcm1hbGl6YXRpb24kSlBfU2FsZXMpDQpkYXRhc2V0JE90aGVyX1NhbGVzPC1ub3JtYWxpemUoZGF0c2V0V2l0aG91dE5vcm1hbGl6YXRpb24kT3RoZXJfU2FsZXMpDQpkYXRhc2V0JEdsb2JhbF9TYWxlczwtbm9ybWFsaXplKGRhdHNldFdpdGhvdXROb3JtYWxpemF0aW9uJEdsb2JhbF9TYWxlcykNCmBgYA0KbWluLW1heCBub3JtYWxpemF0aW9uDQp3ZSB3aWxsIHVzZSB0aGUgbWluLW1heCBub3JtYWxpemF0aW9uOyBpdCdzIGJldHRlciBmb3IgdmlzdWFsaXphdGlvbi4NCg0KDQojIEZlYXV0cmUgc2VsZWN0aW9uDQpPdXIgY2xhc3MgbGFiZWwgKHBvcHVsYXIpIHJlZmVycyB0byBHbG9iYWxfU2FsZXMuIE90aGVyIHNhbGVzIHJlZ2lvbnMgd2lsbCBiZSBldmFsdWF0ZWQgYmFzZWQgb24gdGhlaXIgaW1wb3J0YW5jZSB0byAoZ2xvYmFsX3NhbGVzKSBjb2x1bW4uIGFuZCB0aG9zZSB0aGF0IGFyZSBsZXNzIGltcG9ydGFudCB3aWxsIGJlIGRlbGV0ZWQgZnJvbSB0aGUgZGF0YXNldC4NCnVzZSByb2NfY3VydmUgYXJlYSBhcyBzY29yZQ0KYGBge3J9DQpyb2NfaW1wIDwtIGZpbHRlclZhckltcCh4ID0gZGF0YXNldFssNzoxMF0sIHkgPSBkYXRhc2V0JEdsb2JhbF9TYWxlcykNCmBgYA0Kc29ydCB0aGUgc2NvcmUgaW4gZGVjcmVhc2luZyBvcmRlcg0KYGBge3J9DQpyb2NfaW1wIDwtIGRhdGEuZnJhbWUoY2JpbmQodmFyaWFibGUgPSByb3duYW1lcyhyb2NfaW1wKSwgc2NvcmUgPSByb2NfaW1wWywxXSkpDQpyb2NfaW1wJHNjb3JlIDwtIGFzLmRvdWJsZShyb2NfaW1wJHNjb3JlKQ0Kcm9jX2ltcFtvcmRlcihyb2NfaW1wJHNjb3JlLGRlY3JlYXNpbmcgPSBUUlVFKSxdDQpgYGANCndlIHdpbGwgcm1vdmUgdGhlIChKUF9TYWxlcykgYmVjYXVzZSBpdCBpcyBvZiBsb3cgaW1wb3J0YW5jZSB0byBvdXIgY2xhc3NfbGFiZWwoR2xvYmFsX1NhbGVzKQ0KYGBge3J9DQpkYXRhc2V0PC0gZGF0YXNldFssLTldDQpgYGANCg0KI0RhdGFzZXQgYWZ0ZXIgcHJlLXByb2Nlc3NpbmcNCmBgYHtyfQ0KcHJpbnQoZGF0YXNldCkNCmBgYA0KDQo=